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EXAMINER'S ANSWER 

( 1) Real Party In Interest 

A statement identifying the real party in interest is contained in tlie brief. 

(2) Related Appeals and Interferences 

A statement identifying the related appeals and interferences which will directly 
affect or be directly affected by or have a bearing on the decision in the pending appeal 
is contained in the brief. The examiner is not aware of any related appeals, 
interferences, or judicial proceedings which will directly affect or be directly affected by 
or have a bearing on the Board's decision in the pending appeal. 

(3) Status of Claims 

The statement of the status of the claims contained in the brief is correct. 

(4) Status of Amendments After Final 

The appellant's statement of the status of amendments and final rejection 
contained in the brief is correct. 

(5) Summary of Claimed Subject Matter 

In the summary of the invention appellants have included a mapping between the 
specification, and claim limitations, that includes expanded notation and reference to 
Figures containing elements not specifically recited in the language of the claims, and 
not previously argued by appellants. The examiner notes that this mapping appears to 
offer an interpretation of certain limitations that is more specific than the claims actually 
require. 
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(6) Grounds of Rejection to be Reviewed on Appeal 

The appellant's statement of the grounds of rejection to be reviewed on appeal is 
correct. 

(7) Claims Appendix 

The copy of the appealed claims contained in the Appendix to the brief is correct. 

(8) Evidence Relied Upon 

- "Music Content Analysis through Models of Audition", Martin et al, ACM 
Multimedia Workshop '98, ACM 1 998 

- U.S. Patent 5,918,223 issued to Blum et al 

(9) Grounds of Rejection 

The following ground(s) of rejection are applicable to the appealed claims: 
Claims 1-20, 22-24, 26-27, 29-31 and 33-43 are rejected under 35 U.S.C. 
103(a) as being unpatentable over "Music Content Analysis through Models of 
Audition", Martin et al, ACM Multimedia Workshop '98, ACM 1998 in view ofU,S. 
Patent 5,918,223 issued to Blum et aL (Of Record) 

In order to clarify the examiners 103(a) rejection reasoning, a brief summary of 
the prior art and of the claimed subject matter wiii first be presented. 

Appellants have disclosed a computer method, program code, and database for 
modeling human perception of music by extracting numerical representations from the 
recordings, weighting the parameters, and adjusting the weighting to find a descriptor 
that most closely matches perceptions reported by human listeners. Appellants have 
also claimed creating and searching a database based on descriptors matching the 
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perception of human listeners. The examiner first notes that the claimed "extraction" 
(extractor), as disclosed in appellants' specification (page 8, lines 7-10) uses methods 
known in the art for the extraction of musical parameters fe.g. tempo, rhythm, etc.), in 
addition to being taught in the referenced prior art. Second, the claimed "weighting" to 
compute a number representing a "descriptor" for the recording, amounts to simple 
statistical weightinp where a factor assigned to a number in a computation (as in 
determining an average), to make the number's effect on the computation reflect its 
importance, (Wikipedia, for example) Hence, a skilled artisan would have known to 
apply "weighting" to the extracted parameters. Prior art Blum also teaches the use of 
such weighting techniques in order to emphasize perceptualiv important sections of 
musical sound (CL6-L40'43), Third, the claimed adjusting of the weighting based on 
the perception of human listeners is exactly what is suggested by prior art Martin, 
Namely, using the response of human listeners to identify musical parameters (page 7, 
para: 2-3, i.e, report human perceptions), the concept of modeling perception by 
building "statistical classifiers" (page 7, para:4), and the importance of "statistical 
equivalency" (page 4, para:7,8) in the classifiers. Statistical classification is a well- 
known statistical procedure in which individual items are placed into groups based on 
quantitative information (Wikipedia, for example), and again would have been 
knowingly implemented by a skilled artisan. 

The examiner has therefore maintained that appellants proposed "model of 
human perception of music" is at least conceptually disclosed in the teachings of Martin 



Application/Control Number: 09/556,086 Page 6 

Art Unit: 2128 

and Blum, and would have therefore knowingly been implemented by a skilled artisan 
having access to these prior art teachings. 

1 ) The Martin reference : The Martin reference is a research paper that conceptually 
proposes computer modeling in a musical multimedia system using human listener 
input to perform such tasks as; identify music genre, find similarities between 
pieces of music, identify musical parameters such as tempo and rhythm, and 
classifying music into categories, (page 7, para:2). Martin also introduces the 
concepts of using the "direct perception" of human listeners into a computational 
model (page 5, para:4, 9), and the concept of multimedia databases being 
accessed based on human perceptions of music (page 2, para:1), e.g. searching a 
database with descriptors similar to matches reported by human listeners. 

2) The Blum reference (US 5.918,223) : The Blum reference (of record) teaches a 
multimedia database system capable of the analysis and comparison of audio data 
files based on content where the analysis produces a numeric value (feature vector) 
that can classify and rank the similarity between individual audio files (Abstract). 
Blum further discloses the extraction of scalar descriptors that numerically describe 
recorded music, creating/searching a database of recorded audio data (Abstract, 
CL6'L12, CL6-L54, Figs. 1-5), and extracting multiple parameters from (n) number 
of recorded (electronic representation) audio files (CL7'L14'47, CL1 5''L29, Figs. 2, 

. 14) Blum also teaches statistical weighting of audio waveform sample (recorded) 
parameters to normalize sampled values of musical parameters (CL1 8-L20-45, 
CL14'L40-49). 
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Against this backdrop the examiner has rejected claims 1-20, 22-24, 26-27, 29- 
31 and 33-43 are rejected under 35 U,S.C. 103(a) as being obvious In view of "Music 
Content Analysis through Models of Audition", Martin et al, ACM Multimedia Workshop 
'98, ACM 1998 in view of aS. Patent 5,918,223 Issued to Blum et al as follows: 

Claims 1-20, 22-24, 26-27, 29-31 and 33-43 are rejected under 35 U.S.C. 
103(a) as being unpatentable over "Music Content Analysis through Models of 
Audition", Martin et al, ACM Multimedia Workshop '98, ACM 1998 in view of U.S. 
Patent 5,918,223 issued to Blum et al. (Of Record) 

Regarding independent claims t 3, 5, and 6 : Martin teaciies methods for 
building a computational model of human perception of music (page 1, para: 3-5, page 
7, parail) by extracting representations (page 6, para: 1-4, Abstract) of musical 
recording parameters (Le, parameters for at least rhythm and pitch). Most importantly 
Martin suggests that elements of a computational model of human perception of music 
should be based on the perceptions reported by a human listener (page 5, para:4, page 
7, para: 1-3), and that computational model descriptors of recordings should closely 
match perceptions as heard (reported) by a human listener (page 7, para:1, 2), Martin 
specifically sets forth that only a human listener can Identify genre" and realize "what 
other pieces or kinds of music it bears similarity to". (I.e. a music classification system 
and model must account for the fact that a sample piece of music can belong to one of 
several "classes" (genre) of music) Examiners' note/observation: The "extractor", as 
disclosed In appellants' specification (page 8, lines 7-10) uses methods known in the art 
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for the extraction of musical parameters, (e.g. tempo, rhythm, etc.) Martir) also 
introduces the concept of multimedia databases being accessed based on liuman 
perceptions of music (page 2, para:1), e.g. searching a database with descriptors 
similar to matches reported by human listeners, Martin further suggests that a human 
listener can perform such tasks as; identify genere of the music, identify what other 
pieces or kinds of music it bears similarity to, have emotional reaction to the music, and 
classify the music, (page 7, para:2) It should also be noted that, according to the 
specification, the claimed "descriptors" are represented using linear models (McCullagh 
and Nelder) that are again admittedly known in the art (specification page 13, lines 4-7). 

Martin does not explicitly disclose combining parameters to compute a descriptor 
or the use of parameter weighting. 

Blum et al discloses a multimedia database system capable of analysis and 
comparison of audio data files based on content where the analysis produces a numeric 
value (feature vector) that can classify and rank the similarity between individual audio 
files (Abstract). Blum further discloses the extraction of scalar descriptors that 
numerically describe recorded music, creating/searching a database of recorded audio 
data (Abstract, CL6'L12, CL6'L54, Figs. 1-5), and extracting multiple parameters from 
(n) number of recorded (electronic representation) audio files (CL7-L14'47, CL15'L29, 
Figs. 2, 14) Blum also teaches statistical weighting of audio waveform sample 
(recorded) parameters to compute a description (numerical) of the sample (Abstract, 
CL10-L67 to CL11-L45, Figs. 6-7), finding a set of numeric values (vectors, e.g a 
numerical discriptor) to classify the similarity between audio files (e.g. songs) in a 
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database (abstract), and a computed distance (difference) between audio files (CLS- 
L35-67). In statistics, "weigtiting" is a tectinique used to assure representation of 
certain groups in the sample. Data for underrepresented cases are weiglited to 
compensate for thieir small numbers, making the sample a better representation of the 
underlying population, (Source: "Statistical Methods", Freund, Academic Press, 1993) 
Hence "weighting" would have knowingly been incorporated by a skilled artisan, in order 
to provide a better representation of the descriptor parameters, and to "balance" the 
descriptors to match the human perception reported by listeners. Weighting techniques 
are also implemented by Blum (CL14-L41-65, Fig. 13) as noted above. Here amplitudes 
of sound files are weighted to cause statistical values to depend more on louder parts. 
The examiner also maintains that "combining" and "weighting" the descriptors from 
multiple samples of music would be necessary in order to determine the similarity of a 
particular piece of music with other genres as realized by Martin. 

It would have been obvious to one having ordinary skill in the art at the time the 
claimed invention was made to modify the teachings of Martin relating to a 
computational model of human perception of music based on perceptions of a human 
listener, with the teachings of Blum relating to extraction of scalar descriptors that 
numerically describe recorded music, creating/searching a database of recorded audio 
data, to realize the elements of the claimed invention. An obvious motivation exists 
since, as referenced in the prior art, only a human listener can "identify genre" and 
realize "what other pieces or kinds of music it bears similarity to". (See: Martin, page 7, 
para:1, 2). Accordingly, a skilled artisan tasked with realizing a system, method, and 
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database for music searching based on tiuman perception, and tiaving access to tlie 
teacliings of Martin and Blum, would have knowingly modified the teachings of Martin 
with the teachings of Blum (or visa versa) to realize the claimed elements of the 
present invention. 

Per dependent claims 2, 14-17 : Blum would obviously include a computer 
readable medium containing the computer program for performing the disclosed 
techniques relating to music perception and a database of music recordings (Fig, 1), 

Per claims 7, 10-13. 18-20, and 26 : As cited above, Blum teaches a method and 
system for creating and searching a database of data records which are associated with 
music recordings. (Fig. 1) The method and system are based on a model formed from 
the perception of the music inclusive of extracting numeric parameters from an 
electronic representation of musical recordings. Blum also considers the likeness (i.e. 
similarities) between the extracted representation of the various musical recordings, 
extracting numeric parameters (i.e. descriptors) from recordings by use of weighting 
parameters (CL17-L7 to CL18-L43), and computing (calculate) the correlation between 
recorded sections (i.e, the stored numerical descriptors). Blum also teaches identifying 
data records associated with a music recording in a computer readable database 
(CL21-L53 to CL26-L10) based on numerical parameters (descriptors) describing the 
music. Martin also at least conceptually teaches multimedia databases being accessed 
based on human perceptions of music (page 2, para:1), e.g. searching a database with 
descriptors similar to matches reported by human listeners. Hence, using the same 
motivational reasoning set forth above, the combination of Blum and Martin renders 
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obvious adjusting ttie weighting based on tiuman perception and using a iiuman's 
perception of a sound source in modeling tfie effect on tiie descriptors (parameters 
describing tlie music and recording database) oftlie music as would be perceived by 
human subjects. 

Per dependent claims 4. 8, 9, 22-24, 30-3 1 33 : Blum would obviously include a 
computer readable medium containing the computer program for performing the 
disclosed techniques relating to music perception and a database of music recordings 
(Fig. 11 

Per dependent claims 27, 29. 34-43 : This group of claims merely require that 
groups of at least two numeric parameters from well-known musical attributes relating to 
dynamic range, loudness, harmony, rhythm, attack, tempo, note duration, key, etc. be 
selected. (See: Blum Figs. 2-14) 

(10) Response to Argument 

Appellants argue that the rejection has failed to show every feature of the claims 
and have focused on six specific issues as not being taught in the prior art. 

In the final office action of July 7, 2007 pages 2-6 the examiner presented a 
detailed analysis of the rejection reasoning relating to the obviousness of the claimed 
subject matter relative to the prior art. 

In a nutshell, the examiner submits that the underlying fundamental concept 
disclosed in both the present invention, and the prior art Martin, is the use of the results 
of a human listeners perception of music to statistically "weight in" on the model, and 
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use the "humanized" model for searching and archiving music in a musical multimedia 
system and database. Appellants have simply failed to embrace the teachings of the 
prior art references in their entirety, and have instead focused their arguments on 
features in the prior art that art not related to this fundamental concept. 

Issue i : Appellants main thrust is arguing that Martin does not teach a "method 
for building a computational model of human perception of music" as recited in claim 1. 
In response, the examiner submits that it is the combination of Martin and Blum that 
renders the claimed model obvious. Not simply Martin as asserted by appellants. 

As previously set forth in the final office action, the examiner submits that 
applicant's "computational model of human perception" as presently claimed, simply 
consists of three method steps. For example, referring to independent claim 1 they are: 

1) extracting two numeric parameters (min.) from five musical recordings (min.) 

2) combining numeric parameters with weighting forming a single number descriptor 

3) adjusting the weighting to find set matching perceptions reported by human listeners. 

The examiner maintains that these broad limitations are rendered obvious by the 
prior art for the following reasons. 

First, Martin clearly sets forth that the research disclosed in the reference is 
directed toward the construction of a "model" of "human music perception " (page 7, 
para:1) including using the response of human listeners to identify musical parameters 
and "classify" the music (page 7, para: 2-3, i.e. report human perceptions). Martin also 
introduces the concept of modeling perception by building "statistical classifiers" for 
evaluating musical "properties" and making "musical judgments" (page 7, para:4). 
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As to the first method step, the claimed extracted two "numeric parameters" are 
simply any two musical parameters including rhythm, tempo, loudness, and harmonic 
content (Specification 4,2.4). These elements are taught by both Martin (page: 2) and 
Blum (Figs. 5-1 ) It should also be noted that the "extractor", as described in appellants' 
specification (page 8, lines 7-10) is disclosed as using methods that are admittedly 
known in the art for the extraction of musical parameters, (e.g. tempo, rhythm, etc.) 

Considering the second method step, the claimed combining numeric parameters 
with weighting forming a single number descriptor simply amounts to simple statistical 
weighting. In statistics "weighting" is simply a factor assigned to a number in a 
computation, as in determining an average, to make the number's effect on the 
computation reflect its importance. (Wikipedia, for example) Blum teaches the use of 
such weighting techniques in order to emphasize perceptually important sections of 
musical sound (CL6-L40'43), finding a set of numeric values (vectors, e.g a numerical 
discriptor) to classify the similarity between audio files (e.g. songs) in a database 
(abstract), and a computed distance (difference) between audio files (CL3'L35-67). 
Hence a skilled artisan would have known to "weight" the extracted numeric parameters 
into a single "weighted" representation of the numerical parameter (i.e. a single number 
descriptor) in order to emphasize its computational importance. Martin also sets forth 
the concept of modeling perception by building "statistical classifiers" (page 7, para:4), 
and the Importance of "statistical equivalency" (page 4, para:7,8) in the classifiers. It 
should also be noted that, according to the specification, the claimed "descriptors" are 
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represented using linear models (McCullagh and Nelder) that are again admittedly 
known in the art (specification page 13, lines 4-7), 

As to the final method step of adjusting the weighting to find a set matching 
perceptions reported by human listeners, Martin sets forth using the response of human 
listeners to identify musical parameters and the concept of modeling perception by 
building "statistical classifiers" as noted above. Statistical classification is a well-known 
statistical procedure in which individual items are placed into groups based on 
quantitative information. (See: Wikipedia encyclopedia, for example) In this case, the 
groups are simply the set of weightings matching perceptions reported by human 
listeners. That is, it would appear that Martin, at least conceptually, sets forth the idea of 
using statistical classification with the perception of music reported by human listeners 
in constructing a model of human perception of music (e.g. by "comparison to human 
listening behavior and judgments" (Martin: page 4, para:1)), as does the claimed 
invention. (Also see response to Issue 4) 

Issue 2 : Here appellants appear to again argue that Martin does not teach a 
"method for building a computational model of human perception of music" and that 
Martin lacks the capability to automatically detect genre. Since the computation model 
has already been discussed above, the examiner addresses only the second issue of 
automatically detecting genre. In response the examiner first submits that appellants 
have not specifically claimed the ability to detect genre in the language of the claims. 
Hence, the argument is more specific that the claims require. Second, the only 
mechanism for the detection of genre in the present invention appears to be based on 
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the perceptions reported by human listeners , (specification pages 19-20, e.g scoring 
similarity) This is exactly what is suggested by Martin, Namely that a human listener 
can perform such tasks as; identify genere of the music, identify what other pieces or 
kinds of music it bears similarity to, have emotional reaction to the music, and classify 
the music (page 7, para:2) (page 7, para:2) 

Issue 3 : Motivation to combine the references. The examiner maintains the 
motivation of analogous arts Martin and Blum as cited in the final office action of July 7, 
2006. However, the examiner further notes that there are numerous other motivational 
reasons disclosed in Martin. These include the desire to give computational systems 
"human-like listening skills" (page 2, line 1), and the desire interact with based on user 
(human-like) interests (page 2, para:2). Here the prior art both suggests, and provides 
reasoning by way of examples, of the benefits of including human-like perception in a 
multimedia music system and database. That is, Martin specifically suggests the 
advantages of providing a "humanistic" machine model (e.g. by "comparison to human 
listening behavior and judgments" (Martin: page 4, para:1)), in a multimedia system 
such as Blum. (Martin: page 2, para:1&2) 

Issue 4 : Here appellants appear to argue that the prior art does not disclose the 
claimed feaures relating to "find a set of weightings where each computed descriptor 
(difference number) for each recording (pair) most closely matches perceptions reported 
for the recording by one or more human listeners", "combining the numeric parameters 
with a weighting for each parameter to compute a single number representing the 
descriptor for that recording", and as recited in the claims. 
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In response the examiner submits that in order to find the similarity (likeness) 
between recordings, based on observations made by human listeners, it is obviously 
necessary to statistically balance (i.e. weight), in order to have the descriptor (or 
difference number) accurately "match" the perception of a hurhan listener. This would 
have been known to one of ordinarly skill in the art as noted above. The advantages of 
weighting are realized by Blum where weighting is applied in order to emphasize 
perceptually important sections of musical sound (CL6-L40'43). The use of statistical 
equivalency (e.g. weighting) is also suggested by Martin (page 4, para:7) as noted 
above. The method of initially representing descriptors (specification page 13, line 2 
"McCullagh and Nelder'% and humanly "rating" the descriptors (specification page 9, 
line 24, "Lickert scale") in the present invention is admittedly known in the art. The 
examiner therefor submits that a skilled artisan, motivated as noted above, would have 
knowingly applied (found sets of) weighting for descriptors and difference number, in 
order to emphasize (statistically balance) computational importance of the perception of 
human listeners in the model. 

Issue 5 : Here appellants argue that the prior art does not disclose the claimed 
feature of "assembling the computed difference numbers into a database where each 
computed difference is associated with the identifier for each of two recordings for which 
the difference was computed". As previously noted, Martin suggests that a human 
listener can perform such tasks as; identify genere of the music, identify what other 
pieces or kinds of music it bears similarity to, have emotional reaction to the music, and 
classify the music (page 7, para:2). Appellents proposed "likeness model" (specification 
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4,5) is simply based on the concept of human listeners rating (judging) the similarity 
(likeness) of pairs of songs to form a new descriptor, (e.g. rating pieces of music for 
similarity, as suggest by Martin) Blum discloses finding a set of numeric values (vectors) 
to classify the similarity between audio files (e.g. songs) in a database (abstract) and a 
computed distance (difference) between audio files (CL3-L35-67). 

Looking into appellants specification (sections 4,4,3 and 4.5,3) gives only a 
vague discription of the proposed database oganization. There appears to be no 
specific description of the claimed Identifier". The examiner therfor interprests the 
identifier as simply a similarity identifier. At best, the proposed database appears to 
simply be a multi-dimensional (hierarchical) arrangement based on similarity, Blum 
discloses difference identifiers defining the similarity between audio files. (Abstact) 
Hierarchical databases are very well-known and commonly employed in the art for 
logically a forming a database structure based on a relationship between the data. 
(Microsoft Computer Dictionary, 1997) Hence a skilled artisan would have knowingly 
created a heirarchical database arrangement based on similarity as disclosed in Blum. 

Issue 6 : Here appellants argue that the prior art does not disclose the claimed 
feature of "searching a database containing computed difference numbers between the 
traget recording and a plurality of other recordings for those recordings which a small 
computed difference number from the target music record". Appellants also argue that 
the distance seaching of Blum is not the same as "searching a database" as recited in 
the instant claims. In response the examiner again notes that appellants specification 
(section 4.5.6) gives only a vague discription of the proposed database searching 
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technique. As best understood from the brief descripion in Section 4.5.6, the database 
searching is simply based on lil<eness (cioseness in similarity to a target), and involves 
returning a likeness list to the user. (Specification 4.5.6) Blum discloses searching a 
database and returning ranked list of similar (like) sound files (CL3-L34'39), and 
computing distance to a "target" sound file based on a numerical distance (CL22-L1'20, 
CL24-L6). 

The examiner therefor maintains that the claimed subject matter as disclosed by 
appellants is rendered obvious by the combination of Martin and Blum. 

(11) Related Proceeding(s) Appendix 

No decision rendered by a court or the Board is identified by the examiner in the 
Related Appeals and Interferences section of this examiner's answer. 

For the above reasons it is believed that the rejections should be sustained. 



Respectfully submitted, 
Fred Ferris 

Primary Examiner, AU 2128 



Richard A. Gollhofer 
Reg. No. 31,106 
Staas & Halsey LLP 
Attorney for Applicant(s) 
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